Hamiltonian mechanics

See also classical Hamiltonian system.

The states of a system of Classical Mechanics are determined by generalized variables and their derivatives. In other words, we work in the tangent bundle of the configuration space, or the jet bundle of order 1. That is part of the formulation of Lagrangian Mechanics. The Legendre transform allows us to move everything to the cotangent bundle (phase space). The states of the system are determined by the original variables and generalized momenta. It may seem cumbersome, but it has advantages in geometric interpretation. The systems (dynamical systems) with this origin are called classical Hamiltonian systems. Some of them are called integrable systems. Important example: harmonic oscillator.

The Hamiltonian mechanics formalism can be summarized as follows (modified from "Review of CM and QM by Terence Tao:

1. The physical system has a phase space $\Omega$ of states $\vec{x}$ (often parameterized by position variables $q$ and momentum variables $p$), which mathematically corresponds to a symplectic manifold with a symplectic form $\omega$ (e.g., $\omega = dp \wedge dq$ for position and momentum coordinates).

2. The complete state of the system at any time $t$ is represented by a point $\vec{x}(t)$ in the phase space $\Omega$.

3. Every physical observable $A$ (e.g., energy, momentum, position) is associated with a function $A$ that maps the phase space $\Omega$ to the range of the observable (e.g., for real observables, $A$ maps $\Omega$ to $\mathbb{R}$). Measuring the observable $A$ at time $t$ yields the measurement $A(\vec{x}(t))$.

4. The Hamiltonian $H: \Omega \rightarrow \mathbb{R}$ is a special observable that governs the evolution of the state $\vec{x}(t)$ over time through Hamiltonian equations of motion. In terms of position and momentum coordinates $\vec{x}(t) = (q_i(t), p_i(t))_{i=1}^n$, these equations are given by:

$$ \frac{\partial p_i}{\partial t} = -\frac{\partial H}{\partial q_i}, \quad \frac{\partial q_i}{\partial t} = \frac{\partial H}{\partial p_i}. $$

More abstractly, using the symplectic form $\omega$, the equations of motion can be written as:

$$ \frac{\partial \vec{x}(t)}{\partial t} = -\nabla_\omega H(\vec{x}(t)), \quad (2) $$

where $\nabla_\omega H$ is the symplectic gradient of $H$.

Hamilton's equations of motion can also be expressed in a dual form using observables $A$ as Poisson's equations of motion:

$$ \frac{\partial A(\vec{x}(t))}{\partial t} = -\{H, A\}(\vec{x}(t)), $$

where $\{H,A\} = \nabla_\omega H \cdot \nabla A$ is the induced Poisson bracket (here we consider an arbitrary Riemannian metric defined on $\Omega$ in order to define $\nabla A$ and the dot product. See relation symplectic form and Riemannian metric). In a more abstract form, Poisson's equation can be written as:

$$ \frac{\partial A}{\partial t} = -\{H,A\}, \quad (3) $$

where $\{H,A\}$ represents the Poisson bracket.

In the formalism above, we assume the system is in a pure state at each time $t$, occupying a single point $\vec{x}(t)$ in phase space. However, mixed states can also be considered, where the state of the system at time $t$ is described by a probability distribution $\rho(t,\vec{x})\, d\vec{x}$ on the phase space. Measuring an observable $A$ at time $t$ becomes a random variable, and its expectation $\langle A \rangle$ is given by:

$$ \langle A \rangle(t) = \int_\Omega A(\vec{x}) \rho(t,\vec{x})\, d\vec{x}, \quad (4) $$

The equation of motion for a mixed state $\rho$ is given by the advection equation:

$$ \frac{\partial \rho}{\partial t} = \text{div}(\rho \nabla_\omega H)$$

using the same vector field $-\nabla_\omega H$ as in equation (2). This equation can also be derived from equations (3), (4), and a duality argument.

Pure states can be seen as a special case of mixed states, where the probability distribution $\rho(t,\vec{x})\, d\vec{x}$ is a Dirac delta $\delta_{\vec{x}(t)}(\vec{x})$. Mixed states can be thought of as continuous averages of pure states, or equivalently, the space of mixed states is the convex hull of the space of pure states.

Consider a system of 2 particles with a joint phase space $\Omega = \Omega_1 \times \Omega_2$, where $\Omega_1$ and $\Omega_2$ are the individual one-particle phase spaces. A pure joint state is represented by a point $x = (\vec{x}_1, \vec{x}_2)$ in $\Omega$, where $\vec{x}_1$ and $\vec{x}_2$ represent the states of the first and second particles, respectively. If the joint Hamiltonian $H: \Omega \rightarrow \mathbb{R}$ splits as:

$$ H(\vec{x}_1, \vec{x}_2) = H_1(\vec{x}_1) + H_2(\vec{x}_2), $$

then the equations of motion for the first and second particles are completely decoupled, with no interactions between them. However, in practice, the joint Hamiltonian contains coupling terms between $\vec{x}_1$ and $\vec{x}_2$ that prevent total decoupling. For instance, the joint Hamiltonian may be:

$$ H(\vec{x}_1, \vec{x}_2) = \frac{|p_1|^2}{2m_1} + \frac{|p_2|^2}{2m_2} + V(q_1 - q_2), $$

where $\vec{x}_1 = (q_1, p_1)$ and $\vec{x}_2 = (q_2, p_2)$ are position and momentum coordinates, $m_1$ and $m_2$ are mass constants, and $V(q_1 - q_2)$ represents the interaction potential depending on the spatial separation $q_1 - q_2$ between the particles.

Similarly, a mixed joint state is a joint probability distribution $\rho(\vec{x}_1, \vec{x}_2)\, d\vec{x}_1\, d\vec{x}_2$ on the product state space. To obtain the (mixed) state of an individual particle, we consider marginal distributions such as:

$$ \rho_1(\vec{x}_1) = \int_{\Omega_2} \rho(\vec{x}_1, \vec{x}_2)\, d\vec{x}_2 $$

for the first particle or

$$ \rho_2(\vec{x}_2) = \int_{\Omega_1} \rho(\vec{x}_1, \vec{x}_2)\, d\vec{x}_1 $$

for the second particle. For $N$-particle systems, if the joint distribution of $N$ distinct particles is given by $\rho(\vec{x}_1, \ldots, \vec{x}_N)\, d\vec{x}_1 \ldots d\vec{x}_N$, then the distribution of the first particle is:

$$ \rho_1(\vec{x}_1) = \int_{\Omega_2 \times \ldots \times \Omega_N} \rho(\vec{x}_1, \vec{x}_2, \ldots, \vec{x}_N)\, d\vec{x}_2 \ldots d\vec{x}_N, $$

and the distribution of the first two particles is:

$$ \rho_{12}(\vec{x}_1, \vec{x}_2) = \int_{\Omega_3 \times \ldots \times \Omega_N} \rho(\vec{x}_1, \vec{x}_2, \ldots, \vec{x}_N)\, d\vec{x}_3 \ldots d\vec{x}_N, $$

and so on.

A typical Hamiltonian for such systems can take the form:

$$ H(\vec{x}_1, \ldots, \vec{x}_N) = \sum_{j=1}^N \frac{|p_j|^2}{2m_j} + \sum_{1 \leq j < k \leq N} V_{jk}(q_j - q_k), $$

which combines single-particle Hamiltonians $H_j$ and interaction perturbations. When momenta $p_j$ and masses $m_j$ are normalized to be of size $O(1)$ and the potential $V_{jk}$ has an average value (i.e., $L^1$ norm) of $O(1)$, the first sum has size $O(N)$ and the second sum has size $O(N^2)$. In order to balance the two components and obtain more interesting limiting dynamics as $N \rightarrow \infty$, we introduce a normalization factor of $\frac{1}{N}$ on the right-hand side, resulting in the Hamiltonian:

$$H(\vec{x}_1, \ldots, \vec{x}_N) = \sum_{j=1}^N \frac{|p_j|^2}{2m_j} + \frac{1}{N} \sum_{1 \leq j < k \leq N} V_{jk}(q_j - q_k).$$

Now consider a system of $N$ indistinguishable particles, where all the state spaces $\Omega_1 = \ldots = \Omega_N$ are identical, and observables (including the Hamiltonian) are symmetric functions of the product space $\Omega = \Omega_1^N$ (i.e., invariant under the action of the symmetric group $S_N$). In this case, we can average over the symmetric group (without affecting physical observables) and assume that all mixed states $\rho$ are also symmetric. However, this comes at the cost of mostly giving up pure states $(\vec{x}_1, \ldots, \vec{x}_N)$, as they are symmetric only in exceptional cases where $\vec{x}_1 = \ldots = \vec{x}_N$.

A typical example of a symmetric Hamiltonian is:

$$H(\vec{x}_1, \ldots, \vec{x}_N) = \sum_{j=1}^N \frac{|p_j|^2}{2m} + \frac{1}{N} \sum_{1 \leq j < k \leq N} V(q_j - q_k),$$

where $V$ is an even function (implying all particles have the same individual Hamiltonian and interact with other particles using the same potential). In many physical systems, it is natural to consider short-range interaction potentials, where the interaction between $q_j$ and $q_k$ is localized to the region $q_j - q_k = O(r)$ for some small $r$. This is modeled by Hamiltonians of the form:

$$H (\vec{x}_1, \ldots, \vec{x}_N) = \sum_{j=1}^N H(\vec{x}_j) + \frac{1}{N} \sum_{1 \leq j < k \leq N} \frac{1}{r^d} V\left(\frac{\vec{x}_j - \vec{x}_k}{r}\right),$$

where $d$ is the ambient dimension of each particle (typically 3 in physical models). The factor of $\frac{1}{r^d}$ normalizes the interaction potential's $L^1$ norm to be $O(1)$. An interesting limit arises when $r$ approaches zero as $N$ goes to infinity with $r = N^{-\beta}$, where $\beta$ is some power law. For example, one can consider $N$ particles of "radius" $r$ bouncing around in a box, which is a basic model for classical gases.

A symmetric mixed state example is a factored state:

$$\rho(\vec{x}_1, \ldots, \vec{x}_N) = \rho_1(\vec{x}_1) \ldots \rho_1(\vec{x}_N),$$

where $\rho_1$ is a single-particle probability density function. In the absence of interaction terms in the Hamiltonian, the factored state property is preserved by Hamilton's equations of motion, with $\rho_1$ evolving according to the one-particle equation. However, with interactions, the factored nature may be lost over time.

Old stuff

Symplectic form

See symplectic form.

Hamiltonian vector fields

See Hamiltonian vector fields.

This symplectic form has a main use: to convert the differential of the hamiltonian, $dH$, into a vector field called the symplectic gradient: the only vector field $X_H$ such that $\omega(X_H,-)=dH$. If we have also a Riemannian metric $g$, it turns out that this symplectic gradient and the usual gradient are orthogonal: remember that given a function $H$ the gradient is the only vector field $\nabla H$ such that $dH=g(\nabla H,-)$. Therefore

$$ g(\nabla H, X_H)=dH(X_H)=\omega(X_H,X_H)=0 $$

It's important to observe that:

1. $i_{X_H} \omega=dH$

2. $\mathcal{L}_{X_H} \omega=0$. It is proven by Cartan's formula.

3. $\mathcal{L}_{X_H}\omega^n=0$. This $2n$- form is a volume form. So the flow over the Hamiltonian vector field conserve the volume. This fact is known as Liouville theorem.

More things about:

$T^*Q$ is a space that represents the states of the system. The function $H: T^*Q \mapsto \mathbb{R}$ determines the dynamic of the system. $H$ produces a vector field, $X$, the symplectic gradient, that yields the dynamic itself. Its flow, $\phi$, is again the dynamic.

$X$ is tangent to hypersurfaces $H=cte$:

$$ X(H)=dH(X)=i_X(\omega)(X)=0 $$

(maybe with a minus sign?)

So $H$ is constant along the $\phi$ curves:

$$ X(H)=\frac{d}{dt}H\circ \phi $$

The one-parameter group $\phi$ is the evolution of the system. This privileged parameter $t$ will be called time.

Imagine other function $F:T^*Q\mapsto \mathbb{R}$. Possibly you want to know how it changes with _time_. Fix a initial point $P$:

$$ D=\frac{d}{dt}(F\circ \phi)=X(F)=dF(X) $$

But observe that if we take $Y$ such that $i_Y \omega=-dF$ then

$$ dF(X)=-i_Y \omega (X)=-\omega(Y,X)= $$ $$ =\omega(X,Y)=i_X\omega(Y)=-dH(Y)=-Y(H) $$

So the interesting quantity $D$ can be computed in two ways

$$ X(F)=-Y(H) $$

This motivate a new definition. Remove, temporarily, the special paper to $H$. For the functions $F, H$ we define the Poisson brackets as the new function:

$$ \{F,H\}=X(F)=-Y(H)=\omega(X,Y) $$

It is satisfied:

$\left\{q_{i}, p_{j}\right\}=\delta_{i j}$

Observe that if we leave a slot $\{-,H\}$ we have an operator that acts over functions. In fact, is the vector field $X$! So the Poisson bracket let us to find the symplectic gradient directly, without the use of $d$ and $\omega$. More explicitly

$$ X_f=\{-,f\} $$

From $d\omega=0$ it can be deduced the Jacobi identity:

$$ \{f,\{g, h\}\}=\{\{f, h\}, g\}+\{h,\{f, g\}\} $$

for any three functions $f, g, h$.

Leaving an slot instead of $f$ we get

$$ \{-,\{g,h\}\}=\left[ \{-,g\}, \{-,h\}\right] $$

expression that can be rewritten as

$$ X_{\{g,h\}}=[X_g,X_h]=X_{\omega(X_g,X_h)} $$

Taking the Poisson bracket of any function with a moment $p_i$ gives us how the function varies along the associated coordinate $q_i$:

$$ \{F,p_i\}=X_{p_i}(F) $$

but $X_{p_i}$ is just $\partial q_i$

$\{f,g\}$ can be interpreted as the area of the paralelogram made of $X_f$ and $X_g$

$$ \{f,g\}=X_g(f)=df(X_g)=\nabla f \cdot X_g $$

Any function $f \in \mathcal{C}^{\infty}(T^*Q)$ is called an observable. They constitute a commutative algebra that is, in fact, a Poisson algebra due to the existence of the Poisson bracket $\{-,-\}$. If you fix any $f\in \mathcal{C}^{\infty}(T^*Q)$ then you get a one-parameter local group of transformations for $\mathcal{C}^{\infty}(T^*Q)$ in the following way.

First, you have the vector field $X_f$ given by

$$ X_f=\{-,f\} $$

Then, the flow theorem for vector fields let you assure that there exists the local flow

$$ \phi_s:T^*Q \mapsto T^*Q $$

such that

1. $\phi_0=id$

1. $\frac{d}{ds} \phi_s (m)|_{s=0}=X_f$

1. $\phi_s \circ \phi_t=\phi_{s+t}$

This can be seen also as a flow of observables in CM.

________________________________________

Author of the notes: Antonio J. Pan-Collantes

antonio.pan@uca.es

INDEX: